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Abstract 

Citrullination, a posttranslational modification of peptidyl arginine to citrulline, plays an essential role in rheumatoid arthritis 
(RA). Citrullination is catalyzed by a group of peptidylarginine deiminases (PADs) including PADI 1, 2, 3, 4 and 6. Many 
Studies have indicated that the gene encoding PADI4 is a factor in susceptibility to RA. Some studies have detected PADI2 
expression in RA synovial tissues, suggesting that PADI2 also plays an important role in the disease. This study evaluated the 
possible association between the PADI2-encoding gene and RA. Seventeen tag SNPs across the PAD locus were genotyped 
using a custom-designed lllumina 96-SNP VeraCode microarray. Peripheral blood samples were collected from patients with 
RA (n = 267), ankylosing spondylitis (AS, n = 51) and healthy controls (n = 160). The results of genotyping were verified using 
Sequenom IVlassARRAY in an independent cohort of 307 patients with RA, 324 patients with AS and 509 healthy controls. A 
western blot analysis was performed using synovial tissue from patients with RA (n = 7), osteoarthritis (OA, n = 7) and AS 
(n = 5) to determine the levels of expression of PADI2. A microarray analysis revealed a significant association between three 
selected PADI2 SNPs (rs2235926, rs2057094, rs2076616) and the presence of RA. The increased susceptibility to RA 
associated with rs2235926 (OR = 1.706733, 95% CI = [1.576366-1.866587], p = 0.000839) and rs2057094 (OR = 1.360432, 95% 
CI = [1.065483-1.869482], p = 0.003291) was further confirmed by the Sequenom MassARRAY. No tag SNPs in the PADI2 
locus showed a significant association with AS. Increased expression of PADI2 was detected in RA synovial tissues compared 
with samples from patients with OA and AS. PADI2 is significantly associated with RA and may be involved in the 
pathogenesis of the disease. 
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Introduction 

Citrullination is a posttranslational modification involving the 
conversion of arginine residues into the amino acid citrulline. This 
modification afiects physiology either by directly modulating 
protein function or by impacting immune recognition of self 
proteins. Rheumatoid arthritis (RA) is an autoimmune disease, 
and serum from RA patients contains a spectrum of autoantibod- 
ies including rheumatoid factor, anti-filaggrin autoantibody, anti- 
keratin antibody (AKA), anti-perinuclear factor, anti-vrmentin and 
anti-cyclic citruUinated peptide antibody (anti-CCP) [1-5]. The B- 
cell epitopes of most RA autoantibodies contain citrulline. 
Antibodies directed again.st citrulline-containing proteins are 
highly specific for RA and can be detected in up to 80% of RA 
patients. Thus, citrullination plays an essential role in the 
autoimmune basis of RA [6]. 

Citrullination is catalyzed by a group of calcium-dependent 
peptidylarginine deiminase (PAD) enzymes. Five mammalian PAD 
family members (PAD or PADI 1-4 and 6), all encoded by a 
cluster of genes on chromosome lp36, have been described and 
show tissue-specific distribution in most body tissues. Over the past 
decade, PAD and protein citrullination have been commonly 
implicated as abnormal pathological features in the inflammatory 



response. The majority of our knowledge regarding the disease- 
related mechanisms of uncontrolled citrullination and anti- 
citruUinated protein antibody (ACPA) development in RA is 
focused on PADI4. Recent studies have indicated that polymor- 
phisms of the PADI4 gene confer susceptibility to RA in people of 
East Asian descent. Case-control association studies and mRNA 
stability assays indicate a strong association between the PADI4 
gene and RA in the Japanese, Korean and Chinese populations. 
However, studies in European populations have produced 
conflicting results. A weak association or no association was found 
in Caucasian populations, including Spanish, Tunisian, British, 
white Hungarian, French, German, North American and Swedish 
populations [7-26]. Moreover, many studies failed to find any link 
between the PADI4 genotype and the presence of anti-CCP 
antibodies, rheumatoid factor or erosions in people with RA 
[19,23,27-29]. 

To investigate how the PAD gene and its expression are 
involved in the RA pathogenic process, some studies have 
investigated the expression and activity of other PAD isotypes in 
the peripheral blood and synovial fluid cells of patients with RA. 
Vossenaar et al. investigated the expression and activity of four 
isotypes of PAD in the peripheral blood and synovial fluid cells of 
patients with RA. They detected that transcription of PADI2 and 
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PADI4 mRNA predominated in peripheral blood monocytes. 
However, PADI4 mRNA was not detectable in the macrophages 
that were abundant in the inflamed RA synovium. They also 
found that PADI2 expression was closely linked with inflammation 
in RA synovial tissue and that PADI2 and citruUinated proteins 
were present in the synovial fluid of RA patients [30] . Vossenaar 
et al. also examined PAD expression in synovial tissues from mice 
with collagen-induced arthritis and mice with streptococcal cell 
wall-induced arthritis. In both mouse models, PADI2 mRNA was 
present in the synovium but was not translated into PADI2 
protein. In contrast, PADI4 mRNA, although absent from healthy 
synovium, was readily transcribed and translated by polymorpho- 
nuclear neutrophils infiltrating the synovial tissue during inflam- 
mation [31]. We also detected PADI4 expression in RA synovial 
membrane and synovial fluid in previous studies [32,33]. The 
above results demonstrate that PADI2 and PADI4 are the most 
likely candidates among the PAD isotypes for the citruUination of 
synovial proteins in RA. Although, among the four isoforms of 
PAD, only PADI4 has been identified as an RA-susceptibility 
gene, it is possible that PADI2 and other PAD members are also 
genetic factors for RA and are involved in the pathogenic process. 

Our study aimed to investigate the association between potential 
single nucleotide polymorphisms (SNPs) in the entire PAD locus 
(including the PADIl, PADI2, PADI3, PADI4 and PADI6 genes) 
and RA risk. We also investigated PADI2 expression in RA 
synovial tissues. Samples collected from patients with ankylosing 
spondylitis (AS) or osteoarthritis (OA) were used as controls. 



Results 

Genotyping SNPs Located in the PAD Locus 

Seventeen SNPs across the gene were genotyped using a 
custom-designed lUumina microarray. AU of the SNPs yielded 
genotypic data, and the study sample success rate was 98.1%. All 
of the SNPs except rs6695849 were found to be in Hardy- 
Weinberg equilibrium (p>0.05) within the healthy samples. The 
dilferences in allele and genotype frequencies between the cases 
and controls were compared. The frequencies of the rs2235926, 
rs2057094, rs2076616 and rsl 1203368 alleles showed a significant 
association with RA (p = 0.012818, 0.016166, 0.045123 and 
0.000229, respectively). The genotype frequencies of the four 
SNPs were also significantly associated with the disease 
(p = 0.028584, 0.037656, 0.043905 and 0.00063, respectively). 
Following the Bonferroni correction, the SNPs rs2235926, 
rs2057094 and rsl 1203368 still had a significant association with 
RA with regards to allele frequency and genotype frequency, 
indicating that these three SNPs in the PADI2 and PADI4 
encoding genes are significandy associated with the disease. We 
also investigated the potential association between PAD polymor- 
phisms and AS. The lUumina microarray detected a significant 
association between rsl635564, rsl204898 and rsl0788668 and 
AS with regards to allele frequencies (p = 0.007332, 0.016945 and 
0.042589, respectively). The genotype frequency of rsl 635564 also 
demonstrated a significant association with AS (p = 0.032555). 
Following the Bonferroni correction, the SNP rsl 635564 in the 
PADI4 region still had a significant association with AS with 
regards to allele frequency and genotype frequency. The lUumina 
microarray results are shown in Table 1 and Table 2. The 
microarray results were submitted to the NCBI Gene Expression 
Omnibus (GEO), and the record was approved and assigned GEO 
accession number GSE39428. 

To evaluate the extent of linkage disequilibrium (LD), the D' 
was calculated between all possible pairs of polymorphisms. An 
LD analysis defined two blocks in the PAD locus in the RA 
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population. One block was comprised of rs2235926, rs2057094 
and rs2076616, with a mean pair-wise D' of 0.923. Another block 
was comprised of rs6683201, rsl635564, rsl635561, rsl2563231, 
rsl204898, rsl0788668, rs2526839, rs6695849 and rs7538876 
with a mean pair-wise D' of 0.923. The results are presented in 
Figure 1. 

An additional genotyping experiment was performed in a larger 
cohort of RA patients and AS patients using the Sequenom 
MassARRAY. The samples used in this study were independent of 
those used in the above microarray. The allele and genotype 
frequencies of all five SNPs did not deviate from Hardy-Weinberg 
equilibrium in the cases or controls. The allele and genotype 
frequencies for SNP rs2235926 [p = 0.000839 and 0.005134, 
respectively; Odds Ratio (95% CI) = 1.706733 (1.576366- 
1.866587) ] had a statistically significant association with RA. 
The allele and genotype frequencies for SNP rs2076616 
[p = 0.003291 and 0.015558, respectively; Odds Ratio (95% 
CI) = 1.360432 (1.107805-1.670670) ] also showed a significant 
association with RA. The other three SNPs were not significandy 
associated with RA. No SNP had a statistically significant 
association with AS. The results are shown in Table 3. Following 
multiple-test correction, these SNPS still showed a significant 
difference in allelic frequency and geiiotypic frequency between 
the RA group and the controls. This result is in accordance with 
the results of the lUumina microarray. 

We investigated the possible association between RF, anti-CCP 
antibody levels and the DNA polymorphisms of the 1 7 SNPs. No 
significant association was obtained between these PAD variants 
and RF or anti-CCP levels in patients with RA. 

PADI2 Expression in Synovial Tissue from RA, AS and OA 
Patients 

A western blot analysis was used to evaluate PADI2 expression 
in synovial tissues from RA, OA and AS patients. The expression 
levels were determined by normalizing their expression to 
GAPDH (37 kDa). The expression of PADI2 (75 kDa) was 
significandy increased in samples from RA patients compared to 
samples fi-om patients with OA (p = 0.007) and AS (p = 0.039). 
The expression of PADI2 was not significantly different in samples 
from patients with AS compared to samples from patients with 
OA. The results are shown in Figure 2. 

Discussion 

In our study, we used a custom-designed lUumina microarray to 
genotype 17 tag SNPs in the PAD locus to determine their 
association with susceptibility to RA. These 17 SNPs span the 
entire PAD region, including PADIl, PADI2, PADI3, PADI4 and 
PADI6. SNPs including rs2235926, rs2057094 and rs2076616 in 
the PADI2 locus showed a significant association with RA. We 
also detected a significant association between rs 11203368 and 
RA. Because rsl 1203368 is located in the PADI4-encoding gene, 
our findings not only support previous finding regarding the 
association between PADI4 and RA susceptibility but also confirm 
the reliability of our genotyping results. The experiment demon- 
strated that other members of the PAD gene family are not 
associated with RA. To verify the above analysis, the Sequenom 
MassARRAY was used with a larger and independent cohort of 
RA patients. The study provided evidence that the SNPs 
rs2235926 and rs2076616 are significantly associated with RA. 
The Sequenom MassARRAY did not detect a significant 
association between SNPs in the PAD locus and AS, indicating 
a particular association between PADI2 and RA. Thus, both the 
microarray and Sequenom analyses provide evidence that PADI2 
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Figure 1. A linkage disequilibrium plot of the genotyped SNPs in the PAD region of in the RA group. Red areas represent higher levels 
of LD. Dark triangles represent haplotype blocks. Site 1-17 repsent rs2235926, rs2057094, rs2076616, rs10788656, rs1005753, rs2977310, rsl886301, 
rsl 1203368, rs6683201, rs1635564, rs1635561, rs12563231, rs1204898, rs10788668, rs2526839, rs6695849 and rs7538876, respectively. 
doi:1 0.1 371/journal.pone.0081 259.g001 



is associated with the risk of developing RA in the Northern 
Chinese population. 

An LD analysis defined two blocks in the PAD locus among the 
RA population. One block was comprised of rs2235926, 
rs2057094 and rs2076616 in the PADI2 gene with a mean pair- 
wise D' of 0.923; the other block was comprised of all other SNPs 
in PADI4 and PADI6. The results indicate that the strong 
association between PADI2 and RA is not a reflection of the 
association between PADI4 and the disease. 

Too et al. genotyped 320 SNPs from the PADI locus (including 
PADIl, PADI2, PADI3, PADI4 and PADI6 genes) in 1,238 RA 
cases and 1,571 control subjects from Malaysia in a case-control 
study. They also conducted a meta-analysis including previously 
published RA data from the East Asian population. A haplotype 
analysis revealed four PADI4 SNPs with a significant association 
with RA. In addition, they detected a novel association between 
the PADI2 genetic variant rsl 005753 and RA [34]. Lee et al. 
conducted a meta-analysis using 4,429 SNPs in 1,527 cases of RA 
and 3,421 controls in a Korean RA GWAS dataset. An ICSN 
Pathway analysis (identilying candidate causal SNPs and path- 
ways) identified three candidates of causal non-HLA SNPs and 
four candidates of causal pathways involving the PADI4, MTR, 
PADI2 and TPH2 genes in RA [35]. Although our results showed 
that rsl005753 was not associated with RA, the genotyping data 
demonstrated that other PADI2 genetic variants have a significant 
association with RA, which is in accordance with the results of the 
above two meta-analyses. Our results and the meta-analyses 
performed by others suggest that polymorphisms in the PADI2 
gene are associated with risk of RA in the Asian population. 

Many studies have investigated the pathogenic role and 
regulatory mechanism of PADI2 in RA. Arandjelovic et al. 
identified PADI2 expression in mast cells from RA synovial fluid. 



They also found that activation of the P2X7 purinergic receptor 
(P2X7) induces PADI2 activity and robust protein citruUination. 
P2X7-mediated activation of PADI2 is sensitive to p38 MAPK 
and protein kinase C inhibitors, and PADI2 regulates the 
expression of TNFR2, ADAMTS9 and Rab6b transcripts in mast 
cells [36] . Within RA synovial tissues, the activation of macro- 
phages and fibroblasts mediated by T-ceU contact or driven by 
cytokines plays a prominent part in RA pathogenesis. Ferrari- 
Lacraz et al. found that PADI2 and PADI4 mRNA and proteins 
were transiently up-regulated in monocytes after contact with T- 
lymphocytes. Stimulation with IL-ip or IFN-fi did not modify 
levels of PADI2 and PADI4 mRNA, but did enhance PADI4 
protein expression. No mRNA or protein of any PAD isotype was 
detected in resting or stimulated synovial fibroblasts. Thus, they 
suggested that contact between stimulated T cells and monocytes/ 
macrophages or cytokine-activated monocytes — /macrophages 
constitutes a highly likely source of PADI2 and PADI4, which 
are observed in inflamed synovial tissues [37]. Calabrese et al. 
conducted a methylation pattern analysis and detected a CpG 
island in the PADI2 promoter. They showed that the over- 
expression of PADI2 was associated with promoter demethylation 
[38]. Lee et al. examined PAD-mediated citruUination and its 
effect on pro-inflammatory activity. They found that PADI2 
overexpression reduced NF-kB activity [39]. Our study detected 
increased PADI2 expression in synovial tissues from RA patients 
compared with samples from OA patients and AS patients. These 
previous studies suggest that PADI2 and its citruUination play an 
important role in RA autoimmunity. Nakayama-Hamada et al. 
suggested that PADI2 and PADI4 have different roles in different 
physiological and pathological conditions [40]. 

Bodnar et al. detected high levels of anti-mutated citruUinated 
vimentin (anti-MCV) in the serum of AS patients [41]. Bay-Jensen 
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RA. W e previously detected the expression of PADI4 in synovial 
membranes and synovial fluid of AS patients, although the levels 
were lower than those in tissues from patients with RA [33]. The 
involvement of PADI4 in the pathogenesis of AS warrants further 
study. 

In summary, our study detected a significant association 
between polymorphisms in the PADI2 gene and RA. We also 
detected increased PADI2 expression in RA synovial tissues. 
Further studies are needed to better understand the role of PADI2 
in the initiation and progression of RA. 
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Figure 2. Semi-quantitative western blot analysis of PADI2 
proteins in RA (A), OA (B) and AS (C) synovial tissues. The loaded 
tissue extracts were probed with an antibody to GAPDH to normalize 
the loading volume (D). 
doi:10.1371/journal.pone.0081259.g002 

et al. detected high levels of matrix metalloproteinase-degraded 
fragments of vimentin in the serum of AS patients [42]. Their 
studies suggest that citruUination may be relevant in the 
pathogenesis of AS. Chen et al. investigated the PADI4 polymor- 
phisms in AS in the Chinese Han population. Their study was 
conducted in a total of 3 1 6 Chinese patients of Han nationality 
with AS and 439 healthy controls. They genotyped 5 SNPs 
including rsl 1203366, rsl 1203367, rs874881, rs2240340 and 
rsl 748033 in the PADI4 gene. No significant differences with 
regards to the frequency of alleles and genotypes were found when 
comparing the cases and controls [43] . The author thus suggested 
that PADI4 polymorphisms may not play an important role in the 
development of AS in the Chinese Han population. In this study, 
we found that rsl 635564 in PADI4 is significantly associated with 



Materials and Methods 

Sample Preparation 

Peripheral blood samples were collected from patients with RA 
or AS. RA was diagnosed according to the criteria of the American 
College of Rheumatology. RA patients had high levels of C- 
reactive protein (30—100 mg/liter, mean 24 mg/liter), anti-CCP 
(300-3,000 U/ml) and rheumatoid factor (RF) (160-2,560 U/ml). 
The diagnosis of AS patients was consistent with the modified New 
York criteria for AS. The patients with AS were positive for the 
HIA-B27 antigen. The healthy individuals were blood donors 
without a personal or family history of serious illness. Patients and 
healthy controls were selected from the same population living in 
the Shandong area of Northern China. The blood samples were 
collected in Monovette tubes containing 3.8% sodium citrate. 

Synovial tissue samples were collected during knee joint 
replacement surgery in patients with RA (n = 7; 4 females; range, 
33-68 years old; mean, 53 years old). RA patients had disease 
durations of 3—9 years and were classified as having erosive RA 
(Larsen class IV-V). Synovial tissues were also collected during 
joint replacement surgery for patients with OA (n= 7; 3 females; 
age range 40-72 years old; mean, 60 years old). The participants 
symptoms fulfilled the modified New York criteria. Synovial tissue 
samples from patients with AS (n = 5, 2 female, 28-54 years old, 
mean 35) were collected during hip joint replacement surgery. AS 
patients had an average disease duration of seven years and were 
positive for HLA-B27 antigen. Patients with AS and RA took 
disease-modifying antirheumatic drugs (DMARDs) before surgery. 
Patients with AS, RA and OA were also medicated with non- 
steroidal anti-inflammatory drugs (NSAIDs). Thus, the medical 
pretreatment did not influence the results, and the experimental 
results are comparable between groups. Synovial samples were 
dissected from the connective tissues and immediately stored at 
— 80°C for later use. 

Both patients and healthy controls gave their written consent to 
participate in the study and allowed their biological samples to be 
genetically analyzed. The Ethical Committee of Shandong 
Provincial Qjanfoshan Hospital approved this study. 

Genomic DNA Isolation, 5NP Selection and Genotyping 

Genomic DNA was extracted from whole blood samples using 
the Omega E-Z 96 Blood DNA kit (Omega, USA) according to 
the manufacturer's protocol. After extraction, the genomic DNA 
was diluted to a final concentration of 15-20 ng/)a.l for use in the 
genotyping assays. 

Tag single nucleotide polymorphisms (tag SNPs) across the PAD 
locus were identified by searching the HapMap database [44,45] . 
Only SNPs with a minor allele frequency (MAF) greater than 5% 
and a pair-wise r^sO.8 were considered. Candidate SNPs were 
submitted to lUumina for a design score. Finally, seventeen SNPs 
were selected, as described in Table 1. 

The seventeen SNPs in Table 1 were genotyped using a 
custom-designed lUumina 96-SNP VeraCode microarray. 
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Peripheral blood samples were collected from patients with RA 
(n = 267, 183 females) and AS (n = 51, 10 females). RA patients 
had a mean age of 51.7 years, while AS patients had a mean age of 
35.9 years. A total of 160 (58 females) healthy individuals with a 
mean age of 48.0 years were blood donors. The work was 
completed at Beijing Yimei Tongdi;, a China-based company, 
which provided technical services for the genotyping. 

The microarray results were verified with an allele-specific 
MALDI-TOF mass spectrometry assay (Sequenom MassAR- 
RAY). Five SNPs, including rs2235926, rs2057094, rs2076616, 
rsl204898 and rsl0788668, were genotyped in a cohort of 307 
patients with RA (252 females), 324 patients with AS (45 females) 
and 509 healthy controls (194 females), none of whom were 
included in the experiments utilizing the Illumina microarray. The 
RA patients had a mean age of 51.43 years, the AS patients had a 
mean age of 28.88 years, and the healthy individuals had a mean 
age of 62.88 years. The polymorphism-spanning fragments were 
amplified by the polymerase [:hain reaction (PGR), and genotyping 
was performed using the Sequenom MassARRAY iPLEX 
platform. Primers for the amplification and extension reactions 
were designed using MassARRAY Assay Design version 3.1 
software (Sequenom, San Diego, CA). The SNP genotypes were 
obtained according to the protocol provided by the manufacturer. 
The work was completed at Bioyoung Tech, a China-based 
company, which also provided technical services for the genotyp- 
ing. 

The genotyping quality was evaluated by a detailed quality 
control procedure consisting of a >95% success identification rate, 
duplicate identification of the genotypes, internal positive control 
samples and testing for the Hardy-Weinberg Equilibrium (HWE). 
The SNPs were analyzed for association by comparing the MAP 
between the cases and controls. Dominant and recessive models 
were considered with respect to the minor allele. Associations 
between the SNPs and the diseases were evaluated using odds 
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